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Within  (even  yean  after  the  birth  of  structural  chemistry  in  1861,  «>tually  all  of 
the  main  ideas  relating  to  line-formula  conventions  were  devised  ana  published 
in  the  leading  chemical  journals  of  a  century  ago.  No  basically  new  practices  appeared 
for  some  79  yean.  Then,  within  on  identically  brief  period  of  just  seven  yean 
(1947-1934),  virtually  all  of  the  fundamental  features  of  structure-delineating  chemical 
notations  appeared  in  the  international  chemical  literature.  The  key  characteristics 
of  the  old  conventions  and  new  systems  are  surveyed. 


“The  recent  international  interest  in  chemical  notation 
has  made  it  seem  profitable  and  desirable  to  examine 
the  historical  records  for  a  guiding  background."  This 
was  the  opening  remark  for  a  report  on  “The  History 
of  Chemical  Notation”  presented  18  years  ago  at  the 
118th  Meeting  of  the  American  Chemical  Society  in 
Chicago.'  That  report  in  turn  quoted  an  opening  remark 
or.  "Chemical  Notation  and  Nomenclature"  by  Samuel 
William  Johnson  that  has  an  amusing  echo  today  because 
his  remark  now  is  nearly  100  years  old:' 

“Beginners  in  Chemistry  are  liable  to  much  confusion  and 
embarrassment  from  the  fact  that  there  are  now  in  use  two 

*  I’rmefUetl  hefnrr  thr  Division  of  Chemical  Symposium  on  Notation 
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distinct  systems  of  Chemical  Notation  and  several  forms  of 
nomenclature." 

Johnson  was  referring  to  notation  developments  that  were 
then  just  10  years  old.  More  recent  details  on  the  "Origin 
of  the  Line- Formula  Method"  were  given  in  1954. '  and 
these  historic  developments  again  were  reported  to  the 
American  Chemical  Society  in  1962,  this  time  with  a 
century-old  perspective.' 

The  key  idea  of  structural  chemistry  was  popularized 
in  1860,  when  the  leading  chemists  of  the  world  attended 
the  first  International  Chemical  Congress  at  Karlsruhe' 
to  resolve  their  confusions  about  atoms,  molecules,  and 
equivalents.  At  the  close  of  this  4-day  session,  Stanislao 
Cannizzaro  clarified  the  conrept  of  molecules  with  his 
reprints  on  “the  message  of  his  old  teacher,  Avogadro, 
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who  was  then  dead  and  forgot  ten."'  Alexander  Butlerov 
thereupon  predicted  that  the  future  task  of  the  chemist 
was  to  determine  the  atomic  arrangements  in  these  mole¬ 
cules.  He  was  the  that  to  use  the  word  “structure"  in 
its  modern  chemical  meaning. 

THE  LINE-FORMULA  CONVENTION 

Ever  since  this  lirst  recognition  of  structural  importance, 
chemists  have  identified  the  molecular  configurations  in 
text  copy  simply  by  delineating  the  symbols  for  the  corre¬ 
sponding  molecular  segments,  one  after  another  as  connect¬ 
ed.  These  are  the  “line-formula”  descriptions 

The  exact  convention  relating  to  "linear  expression  of 
formulae"  was  clearly  defined  by  A.  D.  Mitchell  20  years 
ago:"  “Each  full  point  (or  period  mark)  is  regarded  as 
separating  two  atoms  which  are  directly  linked  in  the 
main  chain  of  a  compound,  and  atoms  or  grcups  attached 
to  each  of  these  atoms  are  written  immediately  after 
it  and  before  the  next  full  point.” 

Josef  Loschmidt  led  this  new  era  of  structural  chemistry 
with  his  publication  of  Chemische  Studien  (Vienna,  1861), 
which  contained  368  remarkably  astute  graphical 
diagrams,  including  scores  of  benzene  ring  postulations’1 
(four  years  before  Kekule  “introduced”  this  ring  to  the 
chemical  world).  His  text  included  a  few  “ratinnelle 
I'onnel’  such  as  C:H,,0,CH.  tor  (’.her,  C_H',,(J,CH  for 
methyl  ethyl  ether,  and  C;H-„0,C>Hl,0,C;H.s  for  the 
diethyl  ether  of  ethylene  glycol.  In  1863,  L.  Carius  showed 
o  few  other  comma-separated,  line-formula  descriptions 
like  CHHH.O.H  and  C.H-,,0,H,"’  and  Emil  Erlenmcyer 
omitted  the  punctuation  with  familiar  forms  like  CiH;OH, 
C,H:I  and  C1H-CC1H7."  Meanwhile  H.  Hubner  had  used 
the  conventional  period  punctuation  in  the  first  line- 
formula  descriptions  to  appear  in  a  chemical  journal,1' 
with  his  discussion  of  the  CH-CN.CO.Br  and 
CH.Br.CO.CN  isomers. 

In  1866,  August  Kekul6,"  Henry  Debus,'*  H.  L.  Buff,1’ 
Erlenmcyer,'"  and  E.  Frankland  and  B.  F,  Duppa1’  com¬ 
pleted  the  popularization  of  this  line-formula  technique 
with  scores  rf  examples  like  CHi.CO.OH,  CHi.CO.CHi, 
NC — CHj—COiH,  C«Hs— CH; — Br,  and  C*H  .S0,.0H. 
Debus  described  ethylene  simply  as  CH..CH.  and  acetyl¬ 
ene  as  CH.CH;  the  unsaturation  mark  that  Erlenmcyer 
introduced  at  this  same  time  was  picked  up  two  yea: ? 
later  by  C.  Graebe'"  in  his  CtH-.-  N  =N— CsH-,  descript.on 
of  azobenzene,  and  by  A.  W.  Hofmann1'*  in  his  explanation 
of  — N=C=S  and  -S-CsN  isorners.  Finally,  when 
H.  Wichelhaus  introduced  the  word  “valenz”  (as  a 
simplification  of  equivalence  or  quantivalence)  in  1867, 
he  suggested  the  punctuating  refinement  of  periods  to 
set  off  “side-group”  terminations  and  hyphens  to  show 
directly  connected  groups,  as  in  his  C0.0H-CH2-C0.0H 
and  CHi-CH.0H-C0.0H.JI  Thus,  within  the  short  period 
of  just  seven  years  after  the  birth  of  structural  chemistry 
in  1861,  virtually  all  of  the  main  ideas  relating  to  line- 
formula  descriptions,  were  conceived  and  published.  No 
basically  new  practices  appeared  foi  some  79  years.  Then, 
within  an  identically  brief  period  of  just  seven  years  (1947- 
1954),  virtually  all  of  the  fundamental  features  of 
structure-delineating  chemical  notations  appeared  in  the 
international  chemical  literature. 

Examples  of  these  early  variations  of  "rational  formulas” 
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were  given  with  explanatory  comments  in  two  other  recent 
reports  on  chemical  structure  information  processing; 
hence,  they  need  not  he  repeated  here.'1  *" 

(!.  Malcolm  Dyson  inspired  tl.e  modern  development 
of  chemical  notations  with  his  1947  manual  on  “A  New 
Notation  and  Enumeration  System  for  Organic  Gam- 
pounds.”''  An  intriguing  feature  about  these  first  Dyson 
ciphers  was  that  they  were  restricted  to  capital  letters, 
on-line  numerals,  and  10  punctuation  marks.  A  single 
letter  represented  the  benzene  ring,  and  five  other  letters 
represented  similar  aromatic  skeletons  that  were  used  as 
building  blocks  for  higher  elaborations  of  polycyclic  sys¬ 
tems.  in  addition  to  his  manuals,  Dyson  presented  a  num¬ 
ber  of  papers  on  his  notation. ‘‘  “ 

F.  L.  Taylor  in  1947  developed  a  mathematically  elegant 
method  for  enumerating  topologically  similar  positions  in 
all  ring  systems  —  carbocyclic  or  heterocyclic,  aromatic  or 
saturated.'*  Dyson’s  greatly  enlarged  second  edition 
incorporated  this  Taylor  system  of  ring  enumeration  after 
A.  M.  Patterson  pointed  out  the  "Possibilities  for  a  Com¬ 
bined  System  of  Notation  and  Nomenclature  for  Organic 
Compounds."1' 

The  G-K-D  ciphers  of  Gordon,  Kendall,  and  Davison 
appeared  in  1947  and  1948  with  their  publications  on 
“A  New  Svstemizat'on  of  Chemical  Species”  and  "Chem¬ 
ical  Ciphering:  A  Universal  Code  as  an  Aid  to  Chemical 
Systematics.  At  that  time,  all  three  were  employed 
in  the  research  department  of  the  Dunlop  tire  firm  in 
Birmingham,  England.  They  evidently  designed  their  sys¬ 
tem  for  early  versions  of  computers,  for  Davison  shortly 
thereafter  reported  on  “Programs  ui.d  Equipment  for 
Sorting  Gordon-Kendall-Davison  Punched  Cards  for  Any 
Structurally  Defined  Group.”'"  Again  in  1957,  Davison 
and  Gordon  reported  on  "Sorting  of  Chemical  Groups 
Using  Gordon-Kendall-Davison  Ciphers.”*'  Dcvison  and 
Dyson’s  critical  views  of  each  other’s  approaches  were 
aired  in  the  Letters  column  of  Chrmistry  &  Industry  in 
1954." 

The  Gruber  Notation  was  first  reported  in  1949,“  and 
for  a  time  attracted  the  interest  of  examiners  looking 
for  a  suitable  international  chemical  notation.  Dr.  Gruber 
pleaded  with  the  IUPAC  representatives,  before  and  after 
their  Stockholm  meeting  in  1951,  not  to  he  hasty  in  their 
selection  of  an  internationally  recommended  notation.  He 
provided  some  stimulating  suggestions  for  classification 
prefixes  and  prime  marks  for  another  notation,*'  but  did 
not  pursue  further  development  of  his  own.  In  February 
of  this  year,  he  responded  to  our  news  about  this  Notation 
Symposium  with  his  best  wishes,  and  informed  us  that 
he  will  be  82  years  old  in  June! 

In  1949,  the  IUPAC  Commission  on  Codification, 
Ciphering,  and  Punched  Card  Techniques  announced  its 
interest  in  seeking  on  internationally  suitable  chemical 
notation  and  invited  designers  to  submit  their  proposals 
for  review  in  1951.  It  enumerated  11  desirable  charac¬ 
teristics  or  "desiderata  for  an  internationally  acceptable 
chemical  notation.”*'  These  were:  simplicity  of  use.  ease 
of  printing  and  typewriting,  conciseness,  rerognizability, 
ability  to  generate  a  unique  chemical  nomenclature,  com¬ 
patibility  with  the  accepted  practices  of  inorganic  chemical 
nomenclature,  uniqueness,  generation  of  an  unambiguous 
and  useful  enumeration  pattern,  ease  of  manipulation  by 
machine  methods  for  example,  by  punched  cards,  exliibi- 
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lion  'il  associations  (dcscript i venessl  ami  ability  to  deal 
with  partial  indeterminates. 

The  six  essential  qualities  that  Dyson  cited  at  this 
same  time  seem  more  t  -  the  point,  in  consideration  of 
all  the  subsequent  developments  away  from  chemical 
nomenclature:  First  he  cited  conciseness,  then  linear 
expression,  uniqueness,  adaptability  for  mechanical  mani¬ 
pulation,  general  simplicity,  and  recogniznhility.1’ 

John  A,  Silk  responded  to  the  IUPAC  Commission’s 
invitation  with  his  "New  System  of  Organic  Notation,” 
distributed  privately  in  19M.  More  than  a  decade  later, 
he  published  additional  details  and  improvements  in  his 
"Linear  Notation  for  Organic  Compounds."414'  Silk  had 
been  inspired  by  Dyson’s  London  lecture  of  1946.  In 
1949,  he  wrote  an  article  reviewing  methods  fir  ring  sys¬ 
tems,  but  did  not  publish  it.  His  interest  again  returned 
in  1957  when  he  served  in  an  101  company  committee 
to  review  and  comment  on  the  tentative  draft  of  the 
lUPAC-Dvson  notation.  He  got  busy  again  during  the 
winter  of  19(50  (51  to  write  the  papers  published  in  the 
■Journal  nf  Chemical  Documentation.  J.  (1.  Coekburn,  a 
member  of  the  British  Chemical  Abstracts  staff,  responded 
at  the  same  time  (1951)  with  his  “Newcastle  System." 
Silk  met  him  around  that  time  and  recalls  that  Coekburn 
devised  his  system  more  as  a  chemical  shorthand  than 
a  systematic  notation  and  was  not  proposing  to  develop 
it. 

In  August  1951,  these  linear  notations  of  Dyson,  Cordon, 
Kendall  and  DavL.on,  Gruber,  Silk  and  Coekburn  were 
hriellv  reviewed  by  the  IUPAC  Commission  at  Mas¬ 
sachusetts  Institute  of  Technology,  along  with  a  notation 
that  was  based  on  the  “Principle  of  Least  Effort."41'  This 
notation,  lirsl  described  in  1950,  later  was  reported  in 
Chemical  and  Kn/timering  News,'  in  the  British  ASLIB 
Proceedings,4"  and  finally  was  published  as  a  small 
manual.4 

F.  IL  Benson  meanwhile  had  shown  how  this  American 
notation  could  be  used  for  “Recording  and  Recovering 
Chemical  Information  with  Standard  Tabulating 
Kquipment”";  and  at  this  same  ACS  Meeting  in  1953, 
the  author  described  tabulating  applications  in  n  toxicity 
registry."’ 

K.  G.  Smith  started  using  this  notation  with  standard 
punched-enrd  equipment  in  1952  and  in  1954  wrote  a 
faculty  report  on  its  use  in  “A  Punched  Card  Catalog 
of  the  Physical  Properties  of  Some  Common  Organic 
Chemicals.”  In  19(50,  he  reported  its  attractions  in 
“Machine  Searching  for  Chemical  Structures.”"1  He  was 
s  i  encouraged  with  his  experiences  in  encoding  some  50,000 
structures  that  he  volunteered  to  start  revising  the  1954 
manual.  After  seven  years  of  hard  labor  laboring  mainly 
with  a  control"  ’g  committee  of  notation  users-  he  com¬ 
pleted  a  manual  that  incorporates  a  number  of  his  own 
ideas  and  provides  far  more  comprehensive  coverage  than 
he  1954  edition.  This  new  manual”  is  published  by  the 
McGraw-Hill  Book  Company  ar:d  is  available  with  addi¬ 
tional  computer-generated  services  from  the  Institute  for 
Scientific  Information.” 

H.  1'.  Bonnett  started  studying  this  notation  in 
February  1953,  after  his  interest  was  aroused  by  F.  R. 
Benson.  Bonnett  and  D.  W.  Calhoun  in  19C1  described 
the  “Application  of  a  Line  Formula  Notation  in  an  Index 
of  Chemical  Structures”1:  and  at  the  same  time,  Gelberg, 
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Nelson,  Yee,  and  Metcalf  reported  on  “A  Program 
Retrieval  of  Organic  Structure  Information  via  Punched 
Cards.”"  Both  groups  were  collaborating  with  E.  G.  Smith, 
using  the  notation  that  the  author  called  to  the  attention 
of  the  ACS  Division  of  Chemical  Education  as  a  “New 
Tool  for  Teaching  Structural  Chemistry”  at  this  same 
national  meeting  of  ACS/"’ 

Alan  Gelberg  decided  to  experiment  with  this  "least 
effort”  notation  in  December  1959  at  the  Industri  il  Liaison 
Office  of  Edgewood  Aisenal.  He  was  instrumental  in 
demonstrating  the  capability  of  the  notation  and  the  power 
of  its  permuted  indexing  to  a  number  of  other  subsequent 
users. 

H.  Winston  Haywatd  in  November  1961  announced 
a  “New  Sequential  Enumeration  and  Line  Formula  Nota¬ 
tion  System  for  Organic  Compounds.”'"  By  this  time  the 
IUPAC  Commission  on  Codification,  Ciphering,  and 
Punched  Card  Techniques  had  adopted  the  Dyson  nota¬ 
tion  as  the  basis  for  a  provisional  international  notation, 
had  published  a  tentative  version  in  1958,“  and  a  final 
version  in  1961.’'1  Hayward  later  reported  on  some  experi¬ 
ences  with  his  notation  in  a  summer  training  project,'"’ 
and  additional  developments  are  included  in  this 
symposium.*1  Dyson  also  will  be  reporting  on 
“Modifications  and  Abbreviations  Recommended  for  Com¬ 
puter  and  Visual  Handling  of  the  IUPAC  Notation”  in 
the  next  paper,"'  followed  by  H.  F.  Dammers  and  D. 
J.  Polton’s  “Use  of  the  IUPAC  Notation  in  Computer 
Processing  of  Information  on  Chemical  Structures.”*1 
Joshua  Lederberg,  in  Stanford  University’s  School  of 
Medicine  in  Palo  Alto,  introduced  a  “Notational  Algorithm 
for  Tree  Structures”  called  DENDRAL-64  (dendritic 
o/gnrithm.  1961),  "A  System  for  Computer  Construction, 
Enurriernl  ion  and  Notation  of  Organic  Molecules  as  Tree 
Structures  and  Cyclic  Graphs.”04'*'  Part  II,  “Topology 
of  Cyclic  Graphs,”  appenred  a  year  later1*’  and  his  Sys¬ 
tematica  ofOrg=:.«  Molecules,  Gr^oh  Topology  and  Ham¬ 
iltonian  Circuits,  A  General  Outline  of  the  DENDRAL 
System”  issued  in  1966.  Unique  DENDRAL  notations 
can  be  generated  from  the  nonunique  synonyms  through 
Lederberg’s  computer  programs. 

No  new  a  d  unique  chemical  notation  system  has  come 
to  our  attention  since  1964,  when  the  “Survey  of  Chemical 
Notation  Systems”  was  published.41  Meanwhile  Sorter, 
Granito,  Gelberg,  and  their  former  associates  at  Edgcwooci 
Arsenal  reported  on  encyclopedic  notation  indexes  made 
by  “permuting"  the  symbols  of  our  line  notation  with 
computer  programs.1"  09 

F.  A.  Landee  in  1964  also  reported  on  valuable  computer 
applications,  such  as  a  “Checker”  program  that  calculates 
a  molecular  formula  from  the  WLN  and  prints  out  the 
discrepancies  when  this  does  not  agree  with  the  input 
formula.  1  Errors  are  about  equally  divided  between  nota¬ 
tions  and  formulas!  Landee  carried  these  notation  ideas 
to  Moscow  in  1965,  in  a  paper  on  “Computer  Methods 
of  Handling  Files  of  Chemically  Oiiented  Information.'1 
Other  impressive  notation-programming  developments 
from  his  Computations  Research  Laboratory  were  reported 
elsewhere”  and  in  this  symposium.71 

Gibson,  Granito,  Renard,  and  Metcalf  in  1965  provided 
a  brief  introduction  to  “  The  Wiswesser  Line-Notation,” 
principally  for  chemistry  students  and  teachers.'4  A  more 
comprehensive  introduction  to  the  various  applications 
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i>l  this  WLN  was  presented  at  Kd^ewood  Arsenal  in 
()ctol>er  196(5,  and  these  proceedings  have  just  been 
published."  Related  reports  o  i  joint  studies  at  J.  T.  Baker 
Chemical  Company  and  Fort  Detrick  were  published  else¬ 
where  in  1966." 

J.  K.  Horner  contributed  to  this  Kdgewood  Arsenal 
Conference  with  his  separately  published  report'  on  “Low- 
Cost  Sto.age  and  Retrieval  of  Organic  Structures  by  Per¬ 
muted  Line  Notations:  Small  Collections."  The  presenta¬ 
tions  by  Ernest  Hyde  on  “A  Computer-Generated  Opon- 
Rnded  Fragment  Code"  and  Lucille  Thomson  or  “Struc¬ 
ture  Display"  also  were  elaborated  elsewhere.  ' 

This  concludes  the  brief  and  incomplete  historic  review 
of  chemical  notation  developments  that  were  published 
during  the  past  107  years.  Other  more  impressive  develop¬ 
ments  reported  at  this  San  Francisco  meeting,  such  as 
those  just  announced  by  the  Institute  for  Scientific 
Information,  certainly  should  assure  all  observers  that 
chemical  notations  are  here  to  stay  as  long  as  chemists 
continue  to  read  and  write  about  the  substances  *hey 
are  creating  and  studying. 
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